Direct word graph rescoring using a* search and RNNLM
نویسندگان
چکیده
The usage of Recurrent Neural Network Language Models (RNNLMs) has allowed reaching significant improvements in Automatic Speech Recognition (ASR) tasks. However, to take advantage of their capability for considering long histories, they are usually used to rescore the N-best lists (i.e. it is in practice not possible to use them directly during acoustic trellis search). We propose in this paper a novel method for rescoring directly the hypotheses contained in the word graphs, which are generated in the first pass of ASR decoding. The method, based on the A* stack search, rescores the partial theories of the stack with a log-linear combination of the acoustic model score and a linear combination of multiple language model scores (including RNNLM). We compared, on an ASR task consisting of the automatic transcription of English weather news, the A* based approach with N-best rescoring and iterative confusion network decoding. Using the proposed method, we measured a relative word error rate improvement of about 6%, on the given task, with respect to using the baseline system. The latter improvement is comparable with that obtained with N-best list based rescoring method.
منابع مشابه
Improving WFST-based G2P Conversion with Alignment Constraints and RNNLM N-best Rescoring
This work introduces a modified WFST-based multiple to multiple EM-driven alignment algorithm for Grapheme-to-Phoneme (G2P) conversion, and preliminary experimental results applying a Recurrent Neural Network Language Model (RNNLM) as an Nbest rescoring mechanism for G2P conversion. The alignment algorithm leverages the WFST framework and introduces several simple structural constraints which y...
متن کاملExploiting the succeeding words in recurrent neural network language models
In automatic speech recognition, conventional language models recognize the current word using only information from preceding words. Recently, Recurrent Neural Network Language Models (RNNLMs) have drawn increased research attention because of their ability to outperform conventional n-gram language models. The superiority of RNNLMs is based in their ability to capture long-distance word depen...
متن کاملRescoring-Aware Beam Search for Reduced Search Errors in Contextual Automatic Speech Recognition
Using context in automatic speech recognition allows the recognition system to dynamically task-adapt and bring gains to a broad variety of use-cases. An important mechanism of contextinclusion is on-the-fly rescoring of hypotheses with contextual language model content available only in real-time. In systems where rescoring occurs on the lattice during its construction as part of beam search d...
متن کاملWord Graph Rescoring Using Con dence Measures
This paper presents a novel approach to using con dence scores for word graph rescoring. For each word in the system's vocabulary, we computed the probability that the observation is correct given its acoustic score. Afterwards, we used these probabilities for rescoring word graphs outputted by the recognizer. We will present some implementation details as well as accuracy improvements obtained...
متن کاملWord graph rescoring using confidence measures
This paper presents a novel approach to using con dence scores for word graph rescoring. For each word in the system's vocabulary, we computed the probability that the observation is correct given its acoustic score. Afterwards, we used these probabilities for rescoring word graphs outputted by the recognizer. We will present some implementation details as well as accuracy improvements obtained...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014